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Convergence of spherical averages 
for actions of free groups 

By Alexander I. Bufetov 
1. Introduction 

Let (X, v) be a probability space and suppose a free group F m with m gen- 
erators acts on (X, v) by measure-preserving transformations. Let {a\, . . . , a m } 
be a set of free generators for F m and let Xi, . . . ,T m : X — > X be transfor- 
mations corresponding to the generators. Write T_j = 27" for i = 1, . . . , m, 
and set .A = {— m, . . . , —1, 1, . . . , m}. We also have the action F m on L±(X, u), 
defined by T g ip = ip o T g -i, g € F m . 

Consider the set of all finite words over the alphabet A: 

Wj, = {w = WlW 2 ...W n \Wi€ A}. 

Denote by the length of the word w and for any positive integer n, let 
W^(n) = {w £ Wa, \w\ = n}. 

For each w G Wa, w = w i . . . uu n , define a transformation 

(1) F w — T Wl T W2 . . . T Wn . 

Let IT be a stochastic 2m x 2m matrix, whose rows and columns are 
indexed by elements of A, that is, II = (pij),i,j £ A. Assume that II has a 
unique stationary distribution (p_ m , . . . ,p_i,pi, . . . ,p m ) and that Pi > for 
all i € A. 

For it; € W_a, w = W\ . . . w n , denote 

p(w) = p Wn w n - 1 Pw n - 1 w n --i ■ ■ •JWi, k{w) = p Wn p( w )- 
Consider the operators 

(2) ^ = E <w)T w . 

\w\=n 

In this paper, we investigate convergence of this sequence of operators. 

Definition 1. We shall say that the matrix IT generates the free group if 
Pij = is equivalent to i + j = 0. 



930 



ALEXANDER I. BUFETOV 



We shall need the symmetry condition 

PjPji 



(3) Pi = P-i, P-i-j 

Pi 

Relation (3) is equivalent to saying that all operators are self-adjoint. 
Let be the subgroup of words of even length in F m , that is, the sub- 
group generated by OjOj, i,j £ {1, . . . , m}. 

Recall that LlogL(X, v) = {if £ L\(X, v) : j x \ip\ log + \tp\dv < oo}. 

Theorem 1. Let (X, v) he a Lebesgue probability space. Assume the ma- 
trix IT generates the free group and satisfies (3). Then for any (p G L log L(X, u), 
the sequence s^ip converges as n — > oo both v- almost everywhere and in 
Li(X,v) to an F^- invariant function. 

Remark. The sequence s^ n+l ip also converges. The sequence need not 
converge, however, because the action of F m might have an eigenfunction with 
eigenvalue — 1, that is, a nonzero function ip € L±(X, u) such that Tiip = —ip 
for alH € A (for the same reason, the limit in Theorem 1 must be i^-invariant 
but need not be F m -invariant) . If the action does not have eigenfunctions with 
eigenvalue —1 then for any ip <G LlogL(X, v) the sequence s^ip converges as 
n — > oo both iz-almost everywhere and in Li(X, u) to an F m -invariant limit. 

Averages converge under weaker assumptions on the matrix IT than 
in Theorem 1. 

Definition 2. A matrix IT with nonnegative entries will be called irreducible 
if for some n > all entries of the matrix IT + IT 2 + . . . II n are positive (if IT is 
stochastic then this is equivalent to saying that in the corresponding Markov 
chain any state is attainable from any other state). 

Definition 3. A matrix IT with nonnegative entries will be called strictly 
irreducible if IT is irreducible and ITIT T is irreducible (here IT T stands for the 
transpose of IT.) 

Clearly, a matrix generating the free group is strictly irreducible. 

Theorem 2. Let (X,v) be a Lebesgue probability space and let p > 1. 
Assume the matrix II is strictly irreducible and satisfies (3). Then for any 
ip £ L p (X, u), the sequence s^ n ip converges as n — > oo both v- almost everywhere 
and in L p to an F^- invariant function. 



2. History 

First ergodic theorems for actions of arbitrary countable groups were ob- 
tained by V. I. Oseledets [17] in the following setting. 

Let T be a countable group that acts by measure-preserving transforma- 
tions of a probability space (X, is), and for g <G T let T g be the corresponding 
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transformation. Let /i be a probability measure on L satisfying the condition 
= fi(g). Let be the n-th convolution of fi. The ergodic theorem of 
Oseledets states that for 92 <G L log L(X,u), the averages 

A 2n <p = J2^ 2n \g)T g <p 
<?er 

converge almost everywhere. The proof is based on consideration of the self- 
adjoint Markov operator Q = X^er MsO^V 

In 1969 Y. Guivarc'h [9] (motivated by the work of Arnold and Krylov 
[1]) considered uniform spherical averages on the free group; that is, 

(4) Sn= 2m(2m-l)«- 1 J? Tg 

g:\g\=n 

and proved that for tp € L 2 {X : v) the sequence S2 n p> converges in L2 to an 
F^-invariant function. 

In 1986 R. I. Grigorchuk [6] (see also [7]) announced pointwise convergence 
for the averages 

^ N-l 

C N = -Y.Sn- 

n=0 

In 1994 Nevo and Stein proved: 

Theorem 3 (Nevo and Stein [15]). Letp > 1. Then for any ip G L P (X, u) 
the sequence s^nP converges as n — > 00 both v- almost everywhere and in L p to 
an F^- invariant function. 

The Nevo-Stein theorem is a particular case of Theorem 1; we shall how- 
ever consider it separately in Section 4 in order to illustrate the ideas of the 
proof of Theorem 1. 

3. The Markov operator 

Recall that if (Z, /i) is a probability space then a linear operator Q on 
Li(Z, fj,) is called a measure-preserving Markov operator if it preserves the 
cone of nonnegative functions, Li-norm, and Loo-norm. 

Let p = {p-m, ■ ■ ■ ,P~i,Pi, ■ ■ ■ ,Pm} be the stationary distribution of the 
matrix LI. 

Consider the space Y = X x A with the measure f] = v x p and a Markov 
operator P on L\(Y, rj) given by 

(5) Pip(x,i) = ^2p ij ip(T i x,j). 

jeA 

P is a measure- preserving Markov operator on Li(Y, if). It was introduced 
by R. I. Grigorchuk [7], J.-P. Thouvenot (oral communication), and myself [3]. 
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For n > 1 we have 

(6) P n ip(x,i) = Piw n - l p(w)Pw 1 MT w T i x,j), 

weW A (n-i),jeA 

which implies: 

PROPOSITION 1. Let ip e Li(X,u). Let (p e L\(Y, 77) be given by 
(p(x, a) = ip(x). Then 

To prove Theorem 1, it suffices to prove the following: 

Lemma 1. Suppose II generates the free group and satisfies the symmetry 
condition (3). Suppose the action of F 2 t on (X,u) is ergodic. Then for any 
if € LlogL(Y, V ), 

P n ip -» j ipdn 
Y 

both rj- almost everywhere and in Li{Y,rj). 
First we discuss ergodicity of P and P 2 . 

Lemma 2. If the action of F m on (X,u) is ergodic and H is strictly 
irreducible, then P is ergodic. 

Definition 4. A function (p € L±(Y, rj) does not depend on A if there exists 
ip € L\(X, u) such that ip(x, a) = ip(x) for all a £ A. 

Definition 5. A subset of A of Y will be called P-invariant if Pxa = XA 
(where \A stands for the characteristic function of A) . 

Ergodicity of a measure-preserving Markov operator is equivalent to the 
absence of nontrivial invariant subsets (see [20]). Lemma 2 follows now from: 

Proposition 2. Suppose that H is strictly irreducible. Then A C Y is 
P-invariant if and only if XA does not depend on A and is F m -invariant. 

Proof. If p k i > then XA{TkX,l) = XA{x,k). If (H T H)ij > then there 
exists k £ A such that p^ > 0, pkj > 0. Therefore, xa(x, k) = XA(TkX,i) = 
XA(TkX, j), which implies xa(x,i) = XA(x,j) and proves that xa does not 
depend on A. The equality XA(TiX,j) = xa(x,i), true when pij > 0, and the 
irreducibility of II imply group-invariance of XA- 

Lemma 3. Suppose that II is strictly irreducible and F^ acts ergodically 
on (X,u). Then the operator P 2 is ergodic. 
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By Lemma 2, P is ergodic. If P 2 is not ergodic, then there exists a 
nonconstant function ip G Li(Y, 77) such that = — V'- Arguing in the same 
way as in Proposition 2, we obtain that ^ does not depend on A, in other words, 
there exists <p G Li(X, u) such that ip(x,a) = p(x). The relation Pip = —ip 
implies Tup = —p for all i £ A, whence T g p = p for all g G F^, and the 
Lemma is proved. 

Remark. The Kakutani-Hopf ergodic theorem for Markov operators im- 
mediately implies that if the action of F m on (X, v) is ergodic then for any 
p G Li(X, v), 

1 f 

n=0 x 

both zv-almost everywhere and in L\(X, u) as N —>■ 00 (see [7], [3]). 
The operator adjoint to P is given by 

(7) /%-;,-. ;i X— •<•,/)• 

Consider a unitary operator [/ given by 

(8) Up(x,i) = p(TiX,-i). 
Clearly, U 2 = Id. 

Proposition 3. Suppose the matrix H satisfies the symmetry condition 
(3). Then P = UP*U. 

Indeed, using (3), we can write 

P*p{x,i) = ^p-i-jpiT-jxJ) = UPUp{x,i). 
jeA 



4. Uniform spherical averages 

In this section, we illustrate the method of the proof of Theorem 1, by 
deducing the Nevo-Stein theorem from Rota's "Alternierende Verfahren" the- 
orem [19] applied to the Markov operator (5). 

Consider uniform spherical averages (4) . They are a particular case of the 
averages s^ for II defined by pij = I /{2m — 1) for i + j ^ and pij = for 
i + j = 0. 

For IT thus defined, the Markov operator (5) takes the form 
(9) Pp(x, i) = E <f( T i x 'j) 
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and its adjoint is given by 

P*<p(x,i) = 1 <f(T-jX,j). 

Lemma 4. For P given by (9) and U given by (8), 

p*p= 2 ^up + — L-id. 

2m - 1 2m - 1 



Proof. We have 

f/ P</?(x, i) = Pip(TiX, -i) = 1 _ ^ <f(x, k ) 

lm 1 k:k+i 



and 



(2m — I) 2 

2m -2 ^ 1 
(2m — 1) ,"rl- 2m — 1 

v 7 k:kf=i 

2m — 2 1 

v 2m-l 2m -1 ' ; 

From Lemma 4 and Proposition 3, by induction, we obtain 

9m _ 9 1 

(10) (P*) n P n = — -UP 2n ~ l + rp*\n-lpn-l 

K J K ' 2m -I 2m- V 1 

or 

2m — 1 1 
p2n— 1 _ U( P* ) n P n tj ( p*\n—l pn—l 

2m -2 { ' 2m -2 1 ; 

The Nevo-Stein theorem easily follows now from the Alternierende Ver- 
fahren theorem of Gian-Carlo Rota [19]: 

Theorem 4 (Rota [19]). Let (Z, /i) be a probability space. Let Q be a meas- 
ure-preserving Markov operator on L\{Z,fj). Then for any <p € L log L(Z,/j,) 
the sequence (Q*) n Q n (p converges ^-almost everywhere and in L\ as n — > oo. 

Theorem 4 generalizes Stein's theorem [21] on convergence of powers of 
self-adjoint operators and easily follows from the Martingale convergence the- 
orem; we recall its proof in Section 6. Ornstein's counterexample [16] shows 
that neither Stein's nor Rota's theorem holds for ip £ L\. 

The equation (10) and Theorem 4 yield the convergence of P 2n ip for (p G 
L log L(Y,r)). Lemma 3 implies F^-invariance of the limit. The Nevo-Stein 
theorem is proved. 
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5. Proof of Lemma 1 

Lemma 5. Suppose H generates the free group and satisfies the symmetry 
condition (3). Then there exists a positive constant c depending only on IT such 
that for any nonnegative ip E Li(Y,n) and any n > 0, 

(11) (P*) n P> > d7P 2n ~y 



Proof. We first prove the statement for n = 1: 
(12) P*P(p > cUPip. 

Now, 

p*p<p{x,i) = ]T 

j,keA Pt 

If II generates the free group, then for any i, k <G A we have J2jeA ^^Pjk > 0- 
Since 

UP{x,i) = ^2p-i^{x,k), 
k 

(12) is proved; in view of Proposition 3, (11) follows by induction, and the 
lemma is proved. 

Now we prove Li-convergence of the powers P n . The following proposition 
is well known (see, for example, [11]). 

Proposition 4. Let Q be a measure-preserving Markov operator on a 
probability space (Z,fj,). Then the tail sigma-algebra of Q is trivial if and only 
if for any if £ Li(Z,fi), (Q*) n (f — > f z (fdfi in Li(Z,fi) as n — > oo. 

Since P = UP*U, triviality of the tail sigma-algebra of P is equivalent 
to the triviality of that of P*. To establish this triviality, we shall use the 
following version of the 0-2 law for Markov operators. 

Lemma 6. Let Q be an arbitrary measure-preserving Markov operator 
on a probability space (Z,/j,). 

If the tail sigma-algebra of Q is trivial then for any ip, ip G L2(Z, /i) 

J (Q*) T V • (Q*) n *Pd(i -> f<pdlij Wit 
z z z 

as n — > oo. 

// the tail sigma-algebra of Q is nontrivial then for any e > there exist 
positive functions (p,ip G L^Z^n) of integral 1 such that 

limsup / (Q*)> • (Q*) n iidfi < e. 

n^oo J 
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The proof of Lemma 6 closely models Vadim A. Kaimanovich's proof of 
the 0-2 law [11] and will be given in Section 6. 

Lemma 7. Under assumptions of Lemma 1, for any ip,ip € L2(Y,rj), 

/ P n ip ■ ipdt] — > / ipdrj / ipdr). 
Jy Jy JY 



This follows from the .fT-property for the operator P, which we prove in 
Section 7 (Lemma 10). 

Lemma 6, Lemma 7, and the inequality (11) easily imply triviality of the 
tail sigma-algebra of P. 

Indeed, for any positive ip, ip € L OQ (Y,rj), we have 

/ P n l p-P n ipdrj = f (P*) n P n ip-ipdrj>c f UP^^ip-ipdrj -» c / y?dr/ / V*? 
jy Jy jy Jy Jy 

as n — > oo. In view of Lemma 6, this relation implies that P* (and hence also 
P, since P = UP*U) has trivial tail sigma-algebra. 
Proposition 4 yields that for any ip G Li(F, ry), 



in L\ as n — > oo. 

Now we establish pointwise convergence of P n ip for 99 G L\ogL(Z, fx). 

Recall that if (Z, fx) is an arbitrary probability space then the Orlicz norm 
(see [24], [22]) on the space L\ogL(Z, fi) can be introduced, for example, by 
putting 

IMUlogL = mf{c : f M • log(M + 2)d/J < 1}. 
Jz c c 

Lemma 8. Let (Z, n) be a probability space and let Q be a measure- 
preserving Markov operator on Li(Z,fi). 

For any p > 1 there exists a constant A p > such that for any ip € 
L p (Z, /j,) we have 

\\sup(Q*) n Q n <p\\L p <A P M Lp . 

n 

There exists a constant A > such that for any ip G LlogL(Z, fi), 
||sup(QTQ>|| Ll <A|M| LlogL . 



Lemma 8 will be proved in Section 6. 
Lemmas 5, 8 yield: 
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Lemma 9. Let p > 1. Then there exists a constant p > 1 such that for 
any tp G L p (Y,rf), 

\\snpP 2n p\\ Lp <A p \\p\\ Lp . 

n 

There exists a constant A > such that for any tp £ L log L(y,rj), 

|| SUpP 2 ™^!!^ < ^IMIilogL- 

n 

Proposition 5. Let Q be a measure-preserving Markov operator on a 
probability space (Z,fi). If the tail sigma- algebra of Q* is trivial then for any 
(p € Li{Z, n) we have Q n p — > / p in L2 as n — > 00. 

The proof is given in Section 6. 

Now let p £ L 2 (Y,rj), fypdij = 0. Then ||P n (^|| L2 -> as n -»■ 00 by- 
Proposition 5. By Lemma 9, for any positive integer k, we have 

||supP 2 " +2 V||l 2 <^ 2 ||P 2 V||l 2 , 

n 

and the right part of the inequality tends to as k — > 00. This implies pointwise 
convergence of P 2n tp for p G L2(Y, 77), and, since we have Li -convergence for 
the whole sequence P n p, we also have pointwise convergence for P n p with 
^eL 2 (Y,7?). 

Since L2 is dense in LlogL, pointwise convergence of P n </? for <p E L2 
and the L log L-maximal inequality of Lemma 9 yield pointwise convergence of 
P n p for any tp G LlogL. 

To complete the proof of Lemma 1 and Theorem 1 , it only remains to prove 
Lemmas 6,7, 8 and Proposition 5. We do so in the following two sections. 



6. Proofs of Lemmas 6, 8 and of Proposition 5 

Let (Z, fj,) be a probability space and let Q be an arbitrary measure- 
preserving Markov operator on Li(Z,fi). Let 

Z z = {z = {z n ),neZ,z n eZ} 

be the space of bi-infinite sequences of elements of Z and let be the Markov 
measure on Z z corresponding to the operator Q and the stationary distribution 
li. Let gq be the shift on (Z Z ,Q^) given by (a Q (z)) n = (z) n+1 ; clearly, <jq 
preserves the measure Q^. 

For any k,m £ {— 00} UZU {00}, k < m, denote by J-JP 1 the sigma-algebra 
on Z z generated by the random variables zi, k < I < m. In particular, Tk is 
the sigma-algebra generated by Zk- We shall sometimes write F>k for JF£° and 
T< k for ^ fc 00 . 
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If <p G Li(Z,fi) and $ G Li(Z z ,Q M ) is given by 3>(z) = <p(zo), then 

£;($(z)|^) = QMz-n), and i?(^($(z)|^)|^o) = (Q*) n QW*o)- 

Rota's theorem (Theorem 4) and Lemma 8 immediately follow now from 
the inverted Martingale dominated convergence theorem and the corresponding 
maximal inequalities (see [13, Chap. IV, Props. 2-8, 2-10]). This argument 
implies, moreover, the following: 

Proposition 6. Suppose that the tail sigma- algebra of Q* is trivial. 
Then for all <p G LlogL(Z,(i), lim (Q*) n Q n tp = f tpdfi. 

n — >oo 

Proposition 6 implies Proposition 5, because if the tail sigma-algebra of a 
Markov operator is trivial then for any (p G L^Z, fx) satisfying f z ipd/i = 0, we 
have 

/ (QTipfdn = / (Q*) n Q> -<pdn^0 
Jz Jz 

as n — ► oo, by Proposition 6. 

Now we prove Lemma 6. The proof closely models Kaimanovich's proof 
of the 0-2 law for Markov operators [11]. 

The first part of the lemma is a corollary of Proposition 5. To prove the 
second part, let be the tail sigma-algebra of Q, that is, Too = ^k>oT> k , 
and assume there exists A G Too such that < Qfx(A) < 1. Set 

$(z) = X a(z)/Q m (A), vjr(z) = X(zZv4) ( z )/Q M (Z z \ A). 

Then ^ are positive, bounded, tail-measurable, / $dQ M = / ^dQ^ = 1, 
$ ■ * = 0. Let M be a constant such that M > M > Set <p k (z k ) = 
E(Q(z)\T< k ), ^k{ z k) = E(^(z)\^F<k)- Clearly, (p k ,ip k are positive and bounded 
from above by M. By the Martingale convergence theorem, <p k {z k ) ~^ &( z )i 
^k( z k) —> ^( z ) both Q M -almost everywhere and in Li(Z z , Q M ) as k — > oo. 
Since $, \I> are J^oo-measurable, we have 

E(<p k (z k )\Too) -> ^(^(^)l-^oo) * 

both Q M - almost everywhere and in Li(Z z , Q M ) as fc — > oo. 
Choose fe in such a way that 

J zZ E(tp k (z k )\Too)E(ip k (zk)\Too)dQ^ < e. 

Clearly, 

E(<p k (z k )\T> n+k ) = {Q*) n ip{z n+k ), E(ip k (z k )\T> n+k ) = {Q*) n ^{z n+k ). 
Therefore, as n — > oo, 



(<?*)"¥>*(*) • (Q*) n ^ fc (z)d/i = / , J B(^(^)|^>„ +fc ) J B(V'fc(^)|-F>„ + fe)ciQ M 



iz 

J zZ E(p k (z k )\J 7 oo)E(ii k (z k )\J : oo)dQ^ < e, 
and Lemma 6 is proved. 
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7. K-property and the proof of Lemma 7 

Let F z be the space of biinfinite sequences of elements of Y: 

Y z = {y:y = (y n ),neZ,y n eY}. 

Let be the measure corresponding to the operator P and the stationary 
distribution rj, and let op be the shift on (Y Z ,P V ). In order to prove Lemma 7, 
it suffices to show that up is mixing. To do so, we establish the following 

Lemma 10. Assume acts ergodically on (X, v) and assume the matrix 
IT is strictly irreducible. Then the system (Y z ,P v ,ap) has K-property. 

The proof is based on the Rohlin-Sinai theorem [18]. First, we give another 
realization of a p. 

Let £.4 be the space of bi-infinite sequences of symbols of A: 

£„4 = {uj : uj = (uj n ), neZ, u n G -4}. 

Let oa : — > be the shift on T,^. Let be the cr^-invariant Markov 
measure on corresponding to the matrix U and its stationary distribution p. 
Consider the map T : E.4 x X — > £,4 x X given by the formula 

(13) T(u,x) = (a A u,T Uo x). 

Clearly, the map T preserves the measure x v. 

Lemma 11. The systems (£^4 x X, /xn x v,T) and (Y z ,P v ,ap) are iso- 
morphic. 

Proof. Let y G Y % . Then y = (y n ), where y n G Y; that is, y n = (i n ,x n ), 
i n G A, x n G X. Set w(y) = (i n ), n G Z and x(y) = xr> The map F : Y z — > 
S.4 x X given by -F(y) = (o>(y), x(y)) produces the desired isomorphism (F is 
invertible because for - almost all y G Y z , we have x\ = Ti xo, X2 = T^xi, 
x-i = T- i _ 1 x Q , etc.) 

Now we establish the K-property for the system (£4 xl,/inx u, T). The 
proof follows the method of Oseledets [17]. 

As in last section, write J r ^ l (Y 1 ') for the cr-algebra in Y z generated by 
the random variables yi, k < I < m; write F™(S_4) for the cr-algebra in £^4 
generated by the random variables ui, k < I < m; write T>k instead of 
T<}~ instead of T^L^, and instead of J^f; finally, denote by B(X) the cr- 
algebra of all z^-measurable subsets of X, by B(Yi_a x X) the cr-algebra of all 
fiu x ^-measurable subsets of £4 x X. 

Let 7r(T) be the Pinsker cr-algebra of T. We shall use the Rohlin-Sinai 
theorem [18] to prove the triviality of 7r(T), and therefore the K-property 
of T. 
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Consider the cr-algebra G+= ^>o(^a) x (the future of our Markov 

process). Clearly, TQ + D Q + , and V k& T k Q + = B(E A x X); by the Rohlin- 
Sinai theorem [18], Q+ D tt(T). Let = .F<o(IU) x (the past of 

our Markov process). Clearly, T _1 C/_ D </_, and V keZ T k G^ = £(£4 x X). 
By the Rohlin-Sinai theorem, £/_ D 7r(T _1 ) = vr(T). We have, therefore, 
vr(T) C g+ A5- 

It is easy to check that (/xn x 1/)— almost surely we have 

(14) (T>o(Z A ) x B{X)) A (^<o(S^) x B(X)) = T (£ A ) x £(X). 

For k G Z, let = ^(S^) xB(X) (the moment A; of our Markov process). 
By (14), tt(T) C So- Since Tir{T) = vr(T) and T^o = Gk, 

7r(T) C A ke zGk- 

Now let <p : S_4 x X — > R be 7r(T)-measurable. Then for any G Z there 
exists ^ : 1" — > R such that (p(u,x) = i/}k(uk,x). 

Since for all A; G Z we have 2£(.E(</?|C7fc)|£7o) = V 3 ) we obtain 

(15) (P*) fe PVo = P fe (P*)Vo = V?o for all fc e N. 

To prove the triviality of n(T), it remains to prove that a function 990 : 
Y — > R, satisfying (15), is a constant. 

Proposition 7. Suppose II is strictly irreducible, ip G Li(Y, 77). T/ien a 
se£ ^4 is P* P- invariant if and only if XA does not depend on A. 

Indeed, 

P*Pxa(x, i) = Y^ ^^-PkiXA{x, I) 
k,i Pi 

and, for i, I fixed, we have J2k E] j ki Pki > if and only if (II T n)^ > 0, which 
implies the proposition. 

Lemma 12. Suppose the matrix U is strictly irreducible. Suppose a set 
A C Y is both P*P and (p*) 2 p 2 invariant. Then XA does not depend on A 
and is F^- invariant. 

By the previous proposition, xa does not depend on A. Write 

XA (x,i) = (P*) 2 P 2 XA(x,i) =J2 PjPjiPkPkj PkiPimXA(T l T- j x,m). 

Pi Pj 

We have, then, xa{x) = xa{TiT^jx) for all j,l such that (U T lT)ji > 0. Since 
the matrix II is strictly irreducible, the claim is proved. 

Lemma 10 is proved and it implies, in particular, that o~p is mixing, which 
yields Lemma 7. 

The proof of Theorem 1 is complete. 
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Remark 1. Let (Z,fi) be a Lebesgue probability space, Q a measure- 
preserving Markov operator on Li(Z,/j,), (Z Z ,Q M ) the space of trajectories 
of Q, oq the corresponding shift, and tt((Jq) the Pinsker sigma-algebra of oq. 
Then we have: 

Proposition 8. Note that tt(<tq) C J r (Z z ). If C C Z and the set 
{z : z G C} G 7t(ctq) £/ien xc = Q k (Q ) k XC = (Q*) k Q k XC for any fceN. 

The proof is the same as that of Lemma 10: first, the Rohlin-Sinai theorem 
gives that 7t(ctq) C T>q{Z' l ) nf<o(2 z ) = JF (Z Z ), then the <7Q-invariance of 
7t(<tq) implies that 7t(<tq) C AkezfkiZ' 1 ), which implies the proposition. 

Remark 2. Let [i be an arbitrary Borel probability a^-invariant measure 
on Clearly, the map T, defined by (13), preserves the measure /x x v. 

Let B = (Bij), i,j G A, be a — 1 matrix, and let /ibea Gibbs measure 
(in the sense of Bowen [2]) on the subshift of S_4 given by the matrix B. 

Arguing in the same way as in the proof of Lemma 10, we see that if B 
is strictly irreducible and the action of on X is ergodic, then the system 
(E_4 x X, n x v, T) has the if -property. 

8. Proof of Theorem 2 

Lemma 13. Suppose that IT satisfies (3) and that all entries of the matrix 
nn T + (nn T ) 2 + • • • + (nn T ) fc are positive. Then there exists a constant c > 
such that for any nonnegative if G Li(Y, 77), 

{{P*) n P n + (p*fpp*p n + ... + (P*) n (PP*) k P n )ip > cUP 2n - 1 ^ 
almost everywhere. 

The proof is the same as that of Lemma 5. 

Lemma 14. Let (Z, fx) be a probability space, let Q be a measure-preserving 
Markov operator on Li(Z,(i), let p > 1 and let k be a positive integer. Then 
for any ip G L p (X,u) the sequence Q n {Q*Q) k {Q*) n (p converges n~ almost ev- 
erywhere and in L p as n — > oo. 

Moreover, 

||sup(QT(Q*Q)*Q>|| Lp <A,|M| Lp . 

n 

Let ip G Li(Z,(i) and define $ G L^Z^, Q M ) by <J>(z) = (f(z Q ). Set 

<(z) = ^(z)ic) = (Q*r^) 

and for i > 1 let 

*\ l (z) = E(E($\r 1 (z)\r n - 1 )\r n ). 
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Clearly, 



KM = (Q*QT(QT^n) 



and 



The statement of the proposition follows now from Rota's theorem (The- 
orem 4) and the L p maximal inequality for martingales (see [13, Prop. IV-2-8]) 
by induction on i. 

In a similar fashion, Lemma 6 implies: 

Lemma 15. Let k be a nonnegative integer. If the tail sigma- algebra of 
Q is trivial then for any ip, ip € L<i(Z, \x) 



as n — > oo. 

// the tail sigma- algebra of Q is nontrivial then for any e > there exist 
positive functions ip, ip G L 00 (Z, of integral 1 such that 



The rest of the proof goes the same way as that of Theorem 1, with 
Lemma 14 being used instead of Lemma 8 and Lemma 15 assuming the role 
of Lemma 6. 



Theorem 2 can be applied to obtain spherical convergence for actions of 
some classes of Markov groups (in the sense of Gromov [8]). 

Let r be a Markov group. Its elements can then be coded by admissi- 
ble words in a topological Markov chain. Assume that the matrix A of the 
chain is irreducible and let LT be the matrix of the Parry measure (in other 
words, the measure of maximal entropy) corresponding to A. If II is strictly 
irreducible and satisfies the symmetry condition (3), then Theorem 2 is appli- 
cable. The spherical averages sjj 1 for II thus chosen can easily be reduced to 
uniform spherical averages in T (see [4]). Theorem 2 then yields convergence 
of uniform spherical averages for the group T. For example, this takes place 
for Vershik's locally finite groups [23]. 

Gromov [8] proved that Gromov hyperbolic groups are Markov. If the 
coding satisfied the assumptions of Theorem 2, then Theorem 2 would yield 
the following: 





9. A conjecture 
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Conjecture 1. Let T be a Gromov hyperbolic group, let S be a sym- 
metric set of generators, and denote by T 2 the subgroup generated by elements 
that have a geodesic representation of even length over the alphabet S. Let 
p > 1. Suppose T acts on a probability space (X, u) by measure-preserving 
transformations. 

Then for any <p £ L P (X, v) the sequence 

1 x , 

#{9 :\9\s = M g .^ =2n 

converges as n — > oo almost everywhere and in L p to a -invariant function. 

Assuming exponential mixing, Fujiwara and Nevo [5] obtained a conver- 
gence theorem for Cesaro averages of the spherical averages for Gromov hy- 
perbolic groups. 
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